skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Emile-Geay, Julien"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. This paper outlines a strategy to teach scientific analysis using Python, integrated with open science publishing concepts. 
    more » « less
    Free, publicly-accessible full text available July 10, 2026
  2. Abstract Paleoclimate reconstructions are increasingly central to climate assessments, placing recent and future variability in a broader historical context. Paleoclimate reconstructions are increasingly central to climate assessments, placing recent and future variability in a broader historical context. Several estimation methods produce plumes of climate trajectories that practitioners often want to compare to other reconstruction ensembles, or to deterministic trajectories produced by other means, such as global climate models. Of particular interest are “offline” data assimilation (DA) methods, which have recently been adapted to paleoclimatology. Offline DA lacks an explicit model connecting time instants, so its ensemble members are not true system trajectories. This obscures quantitative comparisons, particularly when considering the ensemble mean in isolation. We propose several resampling methods to introduce a priori constraints on temporal behavior, as well as a general notion, called plume distance, to carry out quantitative comparisons between collections of climate trajectories (“plumes”). The plume distance provides a norm in the same physical units as the variable of interest (e.g. °C for temperature), and lends itself to assessments of statistical significance. We apply these tools to four paleoclimate comparisons: (1) global mean surface temperature (GMST) in the online and offline versions of the Last Millennium Reanalysis (v2.1); (2) GMST from these two ensembles to simulations of the Paleoclimate Model Intercomparison Project past1000 ensemble; (3) LMRv2.1 to the PAGES 2k (2019) ensemble of GMST and (4) northern hemisphere mean surface temperature from LMR v2.1 to the Büntgen et al. (2021) ensemble. Results generally show more compatibility between these ensembles than is visually apparent. The proposed methodology is implemented in an open-source Python package, and we discuss possible applications of the plume distance framework beyond paleoclimatology. 
    more » « less
  3. Tropical and subtropical hydrological systems are important to water resource management. To improve understanding of these dynamical systems, it is useful to probe their relationship with relevant forcings. Historically, insolation is believed to be a major driver of these dynamics, testable using speleothem stable oxygen isotope records, whose growth regions and precise chronologies enable detailed investigations of tropical/subtropical hydrology. Here we present a systematic analysis of long speleothem records examining the global relationship between and insolation across different timescales and regions. Our analysis reveals that the relationship between speleothem and insolation varies significantly by region, with no latitude, season, or periodicity of insolation bearing global relevance. We demonstrate that, when comparing speleothem to insolation curves, seasonal influence often cannot be distinguished from physical lags without additional constraints from modeling or theory. Most notably, we identify a previously unrecognized asymmetry whereby coherence in the precessional (19–23 kyr) band frequently collapses during glacial periods while maintaining power in the 100 kyr band. This suggests a fundamental reorganization of the hydrological cycle and its response to orbital forcing during glacial periods. Our results indicate that comparing speleothem to single insolation curves oversimplifies the complex relationship between orbital forcing and hydroclimate variability, highlighting our framework's utility for comprehensively exploring these interactions. These findings advance understanding of how mid‐to‐low latitude hydrology responds to external forcing–knowledge that may prove valuable as we face unprecedented CO2‐driven climate change. 
    more » « less
    Free, publicly-accessible full text available August 1, 2026
  4. Abstract Despite increased Atlantic hurricane risk, projected trends in hurricane frequency in the warming climate are still highly uncertain, mainly due to short instrumental record that limits our understanding of hurricane activity and its relationship to climate. Here we extend the record to the last millennium using two independent estimates: a reconstruction from sedimentary paleohurricane records and a statistical model of hurricane activity using sea surface temperatures (SSTs). We find statistically significant agreement between the two estimates and the late 20th century hurricane frequency is within the range seen over the past millennium. Numerical simulations using a hurricane-permitting climate model suggest that hurricane activity was likely driven by endogenous climate variability and linked to anomalous SSTs of warm Atlantic and cold Pacific. Volcanic eruptions can induce peaks in hurricane activity, but such peaks would likely be too weak to be detected in the proxy record due to large endogenous variability. 
    more » « less
  5. Abstract. Climate field reconstruction (CFR) refers to the estimation of spatiotemporal climate fields (such as surface temperature) from a collection of pointwise paleoclimate proxy datasets. Such reconstructions can provide rich information on climate dynamics and provide an out-of-sample validation of climate models. However, most CFR workflows are complex and time-consuming, as they involve (i) preprocessing of the proxy records, climate model simulations, and instrumental observations; (ii) application of one or more statistical methods; and (iii) analysis and visualization of the reconstruction results. Historically, this process has lacked transparency and accessibility, limiting reproducibility and experimentation by non-specialists. This article presents an open-source and object-oriented Python package called cfr that aims to make CFR workflows easy to understand and conduct, saving climatologists from technical details and facilitating efficient and reproducible research. cfr provides user-friendly utilities for common CFR tasks such as proxy and climate data analysis and visualization, proxy system modeling, and modularized workflows for multiple reconstruction methods, enabling methodological intercomparisons within the same framework. The package is supported with extensive documentation of the application programming interface (API) and a growing number of tutorial notebooks illustrating its usage. As an example, we present two cfr-driven reconstruction experiments using the PAGES 2k temperature database applying the last millennium reanalysis (LMR) paleoclimate data assimilation (PDA) framework and the graphical expectation–maximization (GraphEM) algorithm, respectively. 
    more » « less
  6. Abstract Paleoclimate records can be considered low‐dimensional projections of the climate system that generated them. Understanding what these projections tell us about past climates, and changes in their dynamics, is a main goal of time series analysis on such records. Laplacian eigenmaps of recurrence matrices (LERM) is a novel technique using univariate paleoclimate time series data to indicate when notable shifts in dynamics have occurred. LERM leverages time delay embedding to construct a manifold that is mappable to the attractor of the climate system; this manifold can then be analyzed for significant dynamical transitions. Through numerical experiments with observed and synthetic data, LERM is applied to detect both gradual and abrupt regime transitions. Our paragon for gradual transitions is the Mid‐Pleistocene Transition (MPT). We show that LERM can robustly detect gradual MPT‐like transitions for sufficiently high signal‐to‐noise (S/N) ratios, though with a time lag related to the embedding process. Our paragon of abrupt transitions is the “8.2 ka” event; we find that LERM is generally robust at detecting 8.2 ka‐like transitions for sufficiently high S/N ratios, though edge effects become more influential. We conclude that LERM can usefully detect dynamical transitions in paleogeoscientific time series, with the caveat that false positive rates are high when dynamical transitions are not present, suggesting the importance of using multiple records to confirm the robustness of transitions. We share an open‐source Python package to facilitate the use of LERM in paleoclimatology and paleoceanography. 
    more » « less
  7. Abstract Paleoclimate reconstructions are now integral to climate assessments, yet the consequences of using different methodologies and proxy data require rigorous benchmarking. Pseudoproxy experiments (PPEs) provide a tractable and transparent test bed for evaluating climate reconstruction methods and their sensitivity to aspects of real-world proxy networks. Here we develop a dataset that leverages proxy system models (PSMs) for this purpose, which emulates the essential physical, chemical, biological, and geological processes that translate climate signals into proxy records, making these synthetic proxies more relevant to the real world. We apply a suite of PSMs to emulate the widely-used PAGES 2k dataset, including realistic spatiotemporal sampling and error structure. A hierarchical approach allows us to produce many variants of this base dataset, isolating the impact of sampling bias in time and space, representation error, sampling error, and other assumptions. Combining these various experiments produces a rich dataset (“pseudoPAGES2k”) for many applications. As an illustration, we show how to conduct a PPE with this dataset based on emerging climate field reconstruction techniques. 
    more » « less
  8. Abstract Speleothem oxygen isotope records offer unique insights into Asian Monsoon evolution, with their precise chronologies used to identify abrupt climatic events. However, individual records are sometimes used to draw broad conclusions about global climate, without considering the dynamical context in which they exist. We present a robust framework for assessing the regional significance, and hence the potential global significance, of paleoclimate events, using the proposed Meghalayan age onset (associated with the “4.2 ka event”) as a case study. Analyzing 14 well‐dated speleothem oxygen isotope records from the SISAL v3 database and recent literature, we investigate the regional coherency of rapid shifts in Asian paleohydrology, which is the regional center of action for the proposed event, over the Holocene. Three robust methods fail to detect spatially coherent variability consistent with a 4.2 ka event across Asia, either because none exists or because it is of insufficient magnitude. In contrast, the 8.2 ka event is expressed in most records that resolve it. The absence of a clear isotopic excursion across this data set suggests that the “4.2 ka megadrought” was not global, with important implications for archeology and geochronology. This casts doubt on the proposal that the 4.2 ka event marks the onset of a new geologic age. We do, however, observe support for a gradual isotopic enrichment between 3.9 and 3.6 ka, followed by partial recovery—consistent with the “Double Drying” hypothesis and possibly related to changes in El Niño‐Southern Oscillation variability. 
    more » « less
  9. Abstract Studying past climate variability is fundamental to our understanding of current changes. In the era of Big Data, the value of paleoclimate information critically depends on our ability to analyze large volume of data, which itself hinges on standardization. Standardization also ensures that these datasets are more Findable, Accessible, Interoperable, and Reusable. Building upon efforts from the paleoclimate community to standardize the format, terminology, and reporting of paleoclimate data, this article describes PaleoRec, a recommender system for the annotation of such datasets. The goal is to assist scientists in the annotation task by reducing and ranking relevant entries in a drop-down menu. Scientists can either choose the best option for their metadata or enter the appropriate information manually. PaleoRec aims to reduce the time to science while ensuring adherence to community standards. PaleoRec is a type of sequential recommender system based on a recurrent neural network that takes into consideration the short-term interest of a user in a particular dataset. The model was developed using 1996 expert-annotated datasets, resulting in 6,512 sequences. The performance of the algorithm, as measured by the Hit Ratio, varies between 0.7 and 1.0. PaleoRec is currently deployed on a web interface used for the annotation of paleoclimate datasets using emerging community standards. 
    more » « less
  10. null (Ed.)
    Abstract We use theNorthern Hemisphere Tree-RingNetwork Development (NTREND) tree-ring database to examine the effects of using a small, highly-sensitive proxy network for paleotemperature data assimilation over the last millennium. We first evaluate our methods using pseudo-proxy experiments. These indicate that spatial assimilations using this network are skillful in the extratropical Northern Hemisphere and improve on previous NTREND reconstructions based on Point-by-Point regression. We also find our method is sensitive to climate model biases when the number of sites becomes small. Based on these experiments, we then assimilate the real NTREND network. To quantify model prior uncertainty, we produce 10 separate reconstructions, each assimilating a different climate model. These reconstructions are most dissimilar prior to 1100 CE, when the network becomes sparse, but show greater consistency as the network grows. Temporal variability is also underestimated before 1100 CE. Our assimilation method produces spatial uncertainty estimates and these identify treeline North America and eastern Siberia as regions that would most benefit from development of new millennial-length temperature-sensitive tree-ring records. We compare our multi-model mean reconstruction to five existing paleo-temperature products to examine the range of reconstructed responses to radiative forcing. We find substantial differences in the spatial patterns and magnitudes of reconstructed responses to volcanic eruptions and in the transition between the Medieval epoch and Little Ice Age. These extant uncertainties call for the development of a paleoclimate reconstruction intercomparison framework for systematically examining the consequences of proxy network composition and reconstruction methodology and for continued expansion of tree-ring proxy networks. 
    more » « less